Overview
Brought to you by YData
Dataset statistics
| Number of variables | 7 |
|---|---|
| Number of observations | 43022 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 177 |
| Duplicate rows (%) | 0.4% |
| Total size in memory | 17.1 MiB |
| Average record size in memory | 417.6 B |
Variable types
| Numeric | 1 |
|---|---|
| Text | 2 |
| Categorical | 3 |
| DateTime | 1 |
| Dataset has 177 (0.4%) duplicate rows | Duplicates |
Event is highly overall correlated with Subsystem | High correlation |
Subsystem is highly overall correlated with Event | High correlation |
Unknown is highly imbalanced (82.2%) | Imbalance |
Reproduction
| Analysis started | 2025-03-19 12:37:40.971366 |
|---|---|
| Analysis finished | 2025-03-19 12:39:35.995389 |
| Duration | 1 minute and 55.02 seconds |
| Software version | ydata-profiling vv4.14.0 |
| Download configuration | config.json |
Variables
ID
Real number (ℝ)
| Distinct | 41739 |
|---|---|
| Distinct (%) | 97.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 460027.3 |
| Minimum | 1 |
|---|---|
| Maximum | 2616645 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 672.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 34428 |
| Q1 | 117920.75 |
| median | 255915.5 |
| Q3 | 379962.5 |
| 95-th percentile | 2583662.9 |
| Maximum | 2616645 |
| Range | 2616644 |
| Interquartile range (IQR) | 262041.75 |
Descriptive statistics
| Standard deviation | 716577.04 |
|---|---|
| Coefficient of variation (CV) | 1.5576837 |
| Kurtosis | 4.6987662 |
| Mean | 460027.3 |
| Median Absolute Deviation (MAD) | 130051.5 |
| Skewness | 2.5255952 |
| Sum | 1.9791295 × 1010 |
| Variance | 5.1348266 × 1011 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 56279 | 4 | < 0.1% |
| 32716 | 3 | < 0.1% |
| 56375 | 3 | < 0.1% |
| 206869 | 3 | < 0.1% |
| 256673 | 3 | < 0.1% |
| 272781 | 3 | < 0.1% |
| 56340 | 3 | < 0.1% |
| 272743 | 3 | < 0.1% |
| 56365 | 3 | < 0.1% |
| 432918 | 3 | < 0.1% |
| Other values (41729) | 42991 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 7 | 1 | |
| 14 | 1 | |
| 19 | 1 | |
| 23 | 1 | |
| 51 | 1 | |
| 55 | 1 | |
| 56 | 1 | |
| 58 | 1 | |
| 59 | 1 |
| Value | Count | Frequency (%) |
| 2616645 | 1 | |
| 2616639 | 1 | |
| 2616635 | 1 | |
| 2616632 | 1 | |
| 2616623 | 1 | |
| 2616612 | 1 | |
| 2616606 | 1 | |
| 2616588 | 1 | |
| 2616556 | 1 | |
| 2616543 | 1 |
Node
Text
| Distinct | 727 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 9.798173 |
| Min length | 4 |
Unique
| Unique | 405 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | node-244 |
|---|---|
| 2nd row | node-244 |
| 3rd row | node-94 |
| 4th row | Interconnect-1N01 |
| 5th row | Interconnect-0T00 |
| Value | Count | Frequency (%) |
| gige7 | 4420 | 10.3% |
| interconnect-0n00 | 3059 | 7.1% |
| interconnect-1n01 | 2960 | 6.9% |
| gige6 | 2132 | 5.0% |
| gige3 | 1457 | 3.4% |
| interconnect-1t01 | 1404 | 3.3% |
| interconnect-1n00 | 1089 | 2.5% |
| gige4 | 971 | 2.3% |
| interconnect-0n03 | 926 | 2.2% |
| interconnect-1n03 | 902 | 2.1% |
| Other values (717) | 23702 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 58444 | |
| e | 55260 | |
| - | 31925 | 7.6% |
| o | 31888 | 7.6% |
| 0 | 26995 | 6.4% |
| t | 26592 | 6.3% |
| c | 26592 | 6.3% |
| 1 | 23240 | 5.5% |
| g | 20188 | 4.8% |
| d | 18610 | 4.4% |
| Other values (20) | 101803 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 421537 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 58444 | |
| e | 55260 | |
| - | 31925 | 7.6% |
| o | 31888 | 7.6% |
| 0 | 26995 | 6.4% |
| t | 26592 | 6.3% |
| c | 26592 | 6.3% |
| 1 | 23240 | 5.5% |
| g | 20188 | 4.8% |
| d | 18610 | 4.4% |
| Other values (20) | 101803 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 421537 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 58444 | |
| e | 55260 | |
| - | 31925 | 7.6% |
| o | 31888 | 7.6% |
| 0 | 26995 | 6.4% |
| t | 26592 | 6.3% |
| c | 26592 | 6.3% |
| 1 | 23240 | 5.5% |
| g | 20188 | 4.8% |
| d | 18610 | 4.4% |
| Other values (20) | 101803 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 421537 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 58444 | |
| e | 55260 | |
| - | 31925 | 7.6% |
| o | 31888 | 7.6% |
| 0 | 26995 | 6.4% |
| t | 26592 | 6.3% |
| c | 26592 | 6.3% |
| 1 | 23240 | 5.5% |
| g | 20188 | 4.8% |
| d | 18610 | 4.4% |
| Other values (20) | 101803 |
Subsystem
Categorical
High correlation 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| switch_module | |
|---|---|
| node | |
| gige | |
| unix.hw | |
| action | |
| Other values (8) |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 7.7144484 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | node |
|---|---|
| 2nd row | node |
| 3rd row | node |
| 4th row | switch_module |
| 5th row | switch_module |
Common Values
| Value | Count | Frequency (%) |
| switch_module | 13278 | |
| node | 11144 | |
| gige | 10094 | |
| unix.hw | 2992 | 7.0% |
| action | 2709 | 6.3% |
| clusterfilesystem | 1579 | 3.7% |
| partition | 581 | 1.4% |
| boot_cmd | 403 | 0.9% |
| domain | 150 | 0.3% |
| shutdown_cmd | 51 | 0.1% |
| Other values (3) | 41 | 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| switch_module | 13278 | |
| node | 11144 | |
| gige | 10094 | |
| unix.hw | 2992 | 7.0% |
| action | 2709 | 6.3% |
| clusterfilesystem | 1579 | 3.7% |
| partition | 581 | 1.4% |
| boot_cmd | 403 | 0.9% |
| domain | 150 | 0.3% |
| shutdown_cmd | 51 | 0.1% |
| Other values (3) | 41 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 39337 | |
| i | 31965 | 9.6% |
| o | 28719 | 8.7% |
| d | 25086 | 7.6% |
| t | 20798 | 6.3% |
| g | 20188 | 6.1% |
| s | 18103 | 5.5% |
| c | 18025 | 5.4% |
| u | 17900 | 5.4% |
| n | 17628 | 5.3% |
| Other values (14) | 94142 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 331891 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 39337 | |
| i | 31965 | 9.6% |
| o | 28719 | 8.7% |
| d | 25086 | 7.6% |
| t | 20798 | 6.3% |
| g | 20188 | 6.1% |
| s | 18103 | 5.5% |
| c | 18025 | 5.4% |
| u | 17900 | 5.4% |
| n | 17628 | 5.3% |
| Other values (14) | 94142 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 331891 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 39337 | |
| i | 31965 | 9.6% |
| o | 28719 | 8.7% |
| d | 25086 | 7.6% |
| t | 20798 | 6.3% |
| g | 20188 | 6.1% |
| s | 18103 | 5.5% |
| c | 18025 | 5.4% |
| u | 17900 | 5.4% |
| n | 17628 | 5.3% |
| Other values (14) | 94142 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 331891 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 39337 | |
| i | 31965 | 9.6% |
| o | 28719 | 8.7% |
| d | 25086 | 7.6% |
| t | 20798 | 6.3% |
| g | 20188 | 6.1% |
| s | 18103 | 5.5% |
| c | 18025 | 5.4% |
| u | 17900 | 5.4% |
| n | 17628 | 5.3% |
| Other values (14) | 94142 |
Event
Categorical
High correlation 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.0 MiB |
| temperature | |
|---|---|
| error | |
| status | |
| start | |
| net.niff.up | |
| Other values (15) |
Length
| Max length | 28 |
|---|---|
| Median length | 27 |
| Mean length | 8.5757752 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | status |
|---|---|
| 2nd row | temperature |
| 3rd row | status |
| 4th row | error |
| 5th row | bcast-error |
Common Values
| Value | Count | Frequency (%) |
| temperature | 15525 | |
| error | 10583 | |
| status | 6420 | |
| start | 2639 | 6.1% |
| net.niff.up | 2293 | 5.3% |
| fan | 2246 | 5.2% |
| clusterfilesystem.not_served | 834 | 1.9% |
| clusterfilesystem.no_server | 489 | 1.1% |
| bcast-error | 391 | 0.9% |
| state_change.unavailable | 310 | 0.7% |
| Other values (10) | 1292 | 3.0% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| temperature | 15525 | |
| error | 10583 | |
| status | 6420 | |
| start | 2639 | 6.1% |
| net.niff.up | 2293 | 5.3% |
| fan | 2246 | 5.2% |
| clusterfilesystem.not_served | 834 | 1.9% |
| clusterfilesystem.no_server | 489 | 1.1% |
| bcast-error | 391 | 0.9% |
| state_change.unavailable | 310 | 0.7% |
| Other values (10) | 1292 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 69831 | |
| e | 68835 | |
| t | 56804 | |
| a | 30198 | |
| u | 26356 | 7.1% |
| s | 22334 | 6.1% |
| p | 17982 | 4.9% |
| m | 17204 | 4.7% |
| o | 12546 | 3.4% |
| n | 9928 | 2.7% |
| Other values (14) | 36929 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 368947 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 69831 | |
| e | 68835 | |
| t | 56804 | |
| a | 30198 | |
| u | 26356 | 7.1% |
| s | 22334 | 6.1% |
| p | 17982 | 4.9% |
| m | 17204 | 4.7% |
| o | 12546 | 3.4% |
| n | 9928 | 2.7% |
| Other values (14) | 36929 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 368947 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 69831 | |
| e | 68835 | |
| t | 56804 | |
| a | 30198 | |
| u | 26356 | 7.1% |
| s | 22334 | 6.1% |
| p | 17982 | 4.9% |
| m | 17204 | 4.7% |
| o | 12546 | 3.4% |
| n | 9928 | 2.7% |
| Other values (14) | 36929 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 368947 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 69831 | |
| e | 68835 | |
| t | 56804 | |
| a | 30198 | |
| u | 26356 | 7.1% |
| s | 22334 | 6.1% |
| p | 17982 | 4.9% |
| m | 17204 | 4.7% |
| o | 12546 | 3.4% |
| n | 9928 | 2.7% |
| Other values (14) | 36929 |
Timestamp
Date
| Distinct | 36213 |
|---|---|
| Distinct (%) | 84.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 672.2 KiB |
| Minimum | 2003-12-26 12:36:59 |
|---|---|
| Maximum | 2006-04-30 09:58:24 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=50)
Unknown
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| 1 | |
|---|---|
| 0 | 1150 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43022 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43022 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43022 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 41872 | |
| 0 | 1150 | 2.7% |
Message
Text
| Distinct | 3792 |
|---|---|
| Distinct (%) | 8.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.8 MiB |
Length
| Max length | 811 |
|---|---|
| Median length | 762 |
| Mean length | 26.602854 |
| Min length | 4 |
Unique
| Unique | 2013 ? |
|---|---|
| Unique (%) | 4.7% |
Sample
| 1st row | running |
|---|---|
| 2nd row | ambient=30 |
| 3rd row | configured out |
| 4th row | Linkerror event interval expired |
| 5th row | Link error |
| Value | Count | Frequency (%) |
| linkerror | 8492 | 5.3% |
| interval | 8492 | 5.3% |
| expired | 8492 | 5.3% |
| event | 8492 | 5.3% |
| 6505 | 4.0% | |
| warning | 5063 | 3.1% |
| network | 4808 | 3.0% |
| normal | 4680 | 2.9% |
| node | 3248 | 2.0% |
| command | 3094 | 1.9% |
| Other values (2252) | 99707 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 125056 | 10.9% |
| 120737 | 10.5% | |
| n | 107316 | 9.4% |
| r | 87619 | 7.7% |
| i | 67255 | 5.9% |
| o | 59554 | 5.2% |
| t | 58815 | 5.1% |
| a | 58652 | 5.1% |
| d | 34355 | 3.0% |
| l | 30867 | 2.7% |
| Other values (57) | 394282 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1144508 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 125056 | 10.9% |
| 120737 | 10.5% | |
| n | 107316 | 9.4% |
| r | 87619 | 7.7% |
| i | 67255 | 5.9% |
| o | 59554 | 5.2% |
| t | 58815 | 5.1% |
| a | 58652 | 5.1% |
| d | 34355 | 3.0% |
| l | 30867 | 2.7% |
| Other values (57) | 394282 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1144508 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 125056 | 10.9% |
| 120737 | 10.5% | |
| n | 107316 | 9.4% |
| r | 87619 | 7.7% |
| i | 67255 | 5.9% |
| o | 59554 | 5.2% |
| t | 58815 | 5.1% |
| a | 58652 | 5.1% |
| d | 34355 | 3.0% |
| l | 30867 | 2.7% |
| Other values (57) | 394282 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1144508 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 125056 | 10.9% |
| 120737 | 10.5% | |
| n | 107316 | 9.4% |
| r | 87619 | 7.7% |
| i | 67255 | 5.9% |
| o | 59554 | 5.2% |
| t | 58815 | 5.1% |
| a | 58652 | 5.1% |
| d | 34355 | 3.0% |
| l | 30867 | 2.7% |
| Other values (57) | 394282 |
Interactions
Correlations
| Event | ID | Subsystem | Unknown | |
|---|---|---|---|---|
| Event | 1.000 | 0.085 | 0.664 | 0.387 |
| ID | 0.085 | 1.000 | 0.114 | 0.000 |
| Subsystem | 0.664 | 0.114 | 1.000 | 0.273 |
| Unknown | 0.387 | 0.000 | 0.273 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| ID | Node | Subsystem | Event | Timestamp | Unknown | Message | |
|---|---|---|---|---|---|---|---|
| 142250 | 401341 | node-244 | node | status | 2006-03-16 18:21:49 | 0 | running |
| 267028 | 160244 | node-244 | node | temperature | 2005-10-29 07:27:00 | 1 | ambient=30 |
| 120399 | 18237 | node-94 | node | status | 2004-11-18 13:22:00 | 1 | configured out |
| 343592 | 174111 | Interconnect-1N01 | switch_module | error | 2004-02-27 14:43:50 | 1 | Linkerror event interval expired |
| 65565 | 115375 | Interconnect-0T00 | switch_module | bcast-error | 2004-02-26 05:33:25 | 1 | Link error |
| 217808 | 92432 | node-197 | node | temperature | 2005-03-03 18:36:00 | 1 | ambient=31 |
| 83692 | 2596014 | node-109 | node | status | 2004-01-16 21:36:00 | 1 | configured out |
| 148296 | 2606804 | gige7 | gige | temperature | 2004-01-17 15:14:07 | 1 | warning |
| 179027 | 394103 | gige7 | gige | temperature | 2004-06-30 15:32:45 | 1 | warning |
| 130695 | 284681 | node-172 | node | status | 2005-07-06 20:13:37 | 1 | configured out |
| ID | Node | Subsystem | Event | Timestamp | Unknown | Message | |
|---|---|---|---|---|---|---|---|
| 303785 | 2561163 | Interconnect-1N01 | switch_module | error | 2004-01-06 03:35:47 | 1 | Link in reset |
| 151514 | 31411 | gige7 | gige | temperature | 2004-02-09 10:55:59 | 1 | warning |
| 225548 | 140243 | gige4 | gige | temperature | 2005-03-30 10:49:11 | 1 | normal |
| 321379 | 2606943 | Interconnect-1N01 | switch_module | error | 2004-01-17 22:56:12 | 1 | Linkerror event interval expired |
| 352479 | 262162 | Interconnect-1N01 | switch_module | error | 2004-03-14 01:16:39 | 1 | Linkerror event interval expired |
| 256272 | 90975 | gige6 | gige | temperature | 2005-09-30 18:21:41 | 1 | warning |
| 164176 | 316277 | gige7 | gige | temperature | 2004-04-23 18:37:09 | 1 | warning |
| 246194 | 327030 | gige4 | gige | temperature | 2005-08-06 16:58:06 | 1 | warning |
| 35611 | 127486 | Interconnect-0N01 | switch_module | temphigh | 2005-03-26 10:12:00 | 1 | Temperature (42C) exceeds warning threshold |
| 409194 | 139144 | node-18 | unix.hw | net.niff.up | 2004-02-26 15:43:56 | 1 | NIFF: node node-18 has detected an available network connection on network 5.5.226.0 via interface alt0 |
Duplicate rows
Most frequently occurring
| ID | Node | Subsystem | Event | Timestamp | Unknown | Message | # duplicates | |
|---|---|---|---|---|---|---|---|---|
| 32 | 256707 | node-161 | node | status | 2005-06-16 16:02:45 | 0 | running | 3 |
| 89 | 272478 | node-86 | node | status | 2004-03-18 13:08:56 | 0 | not responding | 3 |
| 145 | 413291 | node-238 | node | status | 2004-08-12 14:21:39 | 0 | configured out | 3 |
| 171 | 56340 | node-188 | node | status | 2005-09-15 20:09:30 | 0 | running | 3 |
| 172 | 56375 | node-225 | node | status | 2005-09-15 20:09:30 | 0 | running | 3 |
| 0 | 112848 | node-109 | node | temperature | 2005-10-09 11:25:30 | 1 | ambient=31 | 2 |
| 1 | 163656 | 3910 | boot_cmd | success | 2005-10-30 11:31:17 | 1 | Command has completed successfully | 2 |
| 2 | 24160 | node-226 | node | status | 2004-11-18 14:35:49 | 0 | running | 2 |
| 3 | 251011 | node-72 | node | status | 2005-06-16 12:41:38 | 0 | not responding | 2 |
| 4 | 251070 | node-100 | node | status | 2005-06-16 12:43:30 | 0 | configured out | 2 |